Replication package for "Getting Out the (Newly-Enfranchised) Vote:
Encouraging voter registration after rights restoration" (Ariel White, Hannah Walker, Melissa Michelson, Sam Roth)
May 2025
Email Ariel with questions: arwhi@mit.edu


This replication package includes de-identified experimental data and analysis code that can be used to reproduce all tables and figures from the paper and SI. We also include the code that was originally used to produce this deidentified dataset, though it cannot be run without the (not shared) personally-identifying data collected from NJ. We have sought to strike a balance between ensuring reproducibility of research and ensuring the privacy of people in the sample, which means we do not provide the raw datasets used in the experiment given that they included names, dates of birth, and addresses. This means that users of this replication package cannot rerun the merge code we used to link individuals to voting records (since they do not have the underlying identifying data), but they can look at the original merge code in order to understand and evaluate the merge approach.

In this file, we describe the contents of the replication package. 

DATA:
- "NJISJ_fall2021exp_deid.csv" is a deidentified individual-level dataset produced by merging the main experimental list to several snapshots of the NJ voter file as described in the SI. This is the dataset used to produce all tables and figures in the paper and SI. 

CODE:
- "record_linkage_final.R" is a script used to merge the raw experimental data to snapshots of the NJ voter file. This code cannot be run without access to both the personally-identifying experimental data (not included in this replication package) and the NJ voter file (not included in this replication package). We include this code, though it cannot be run, to allow readers to understand and evaluate the merge approach used in the paper and to see how the deidentified dataset "NJISJ_fall2021exp_deid.csv" was produced.
- "NJISJ_2021exp_analysis.R" is the main script used to produce figures and tables from the paper and SI. It pulls in deidentified dataset "NJISJ_fall2021exp_deid.csv" and runs the analyses reported in the paper. 

OUTPUT:
- This replication package also contains all the tables and figures produced by "NJISJ_2021exp_analysis.R". We have included them in the replication package so that they can be examined even without running the code, but running the script will regenerate them. 

